Search CORE

396 research outputs found

Linked Data for the Natural Sciences. Two Use Cases in Chemistry and Biology

Author: Cimiano Philipp
Wiljes Cord
Publication venue
Publication date: 01/01/2012
Field of study

Wiljes C, Cimiano P. Linked Data for the Natural Sciences. Two Use Cases in Chemistry and Biology. In: Proceedings of the Workshop on the Semantic Publishing (SePublica 2012). 2012: 48-59.The Web was designed to improve the way people work together. The Semantic Web extends the Web with a layer of Linked Data that offers new paths for scientific publishing and co-operation. Experimental raw data, released as Linked Data, could be discovered automatically, fostering its reuse and validation by scientists in different contexts and across the boundaries of disciplines. However, the technological barrier for scientists who want to publish and share their research data as Linked Data remains rather high. We present two real-life use cases in the fields of chemistry and biology and outline a general methodology for transforming research data into Linked Data. A key element of our methodology is the role of a scientific data curator, who is proficient in Linked Data technologies and works in close co-operation with the scientist

Publications at Bielefeld University

Learning a semantic parser from spoken utterances

Author: Cimiano Philipp
Gaspers Judith
Publication venue
Publication date: 01/01/2014
Field of study

Gaspers J, Cimiano P. Learning a semantic parser from spoken utterances. In: IEEE International Conference on Acoustics, Speech and Signal Processing. 2014

Publications at Bielefeld University

Ontology-based Information Extraction with SOBA

Author: Buitelaar Paul
Cimiano Philipp
Racioppa Stefania
Siegel Melanie
Publication venue
Publication date: 20/12/2011
Field of study

In this paper we describe SOBA, a sub-component of the SmartWeb multi-modal dialog system. SOBA is a component for ontologybased information extraction from soccer web pages for automatic population of a knowledge base that can be used for domainspecific question answering. SOBA realizes a tight connection between the ontology, knowledge base and the information extraction component. The originality of SOBA is in the fact that it extracts information from heterogeneous sources such as tabular structures, text and image captions in a semantically integrated way. In particular, it stores extracted information in a knowledge base, and in turn uses the knowledge base to interpret and link newly extracted information with respect to already existing entities

Hochschulschriftenserver - Universität Frankfurt am Main

Explicit versus Latent Concept Models for Cross-Language Information Retrieval

Author: Boutilier Craig
Cimiano Philipp
Schultz Antje
Sizov Sergej
Sorg Philipp
Staab Steffen
Publication venue: AAAI Press
Publication date: 01/01/2009
Field of study

Cimiano P, Schultz A, Sizov S, Sorg P, Staab S. Explicit versus Latent Concept Models for Cross-Language Information Retrieval. In: Boutilier C, ed. IJCAI 2009, Proceedings of the 21st International Joint Conference on Artificial Intelligence. Menlo Park, CA: AAAI Press; 2009: 1513-1518

Publications at Bielefeld University

A Systematic Investigation of Blocking Strategies for Real-time Classification of Social Media Content into Events

Author: Cimiano Philipp
Reuter Timo
Publication venue: AAAI Press
Publication date: 01/01/2012
Field of study

Reuter T, Cimiano P. A Systematic Investigation of Blocking Strategies for Real-time Classification of Social Media Content into Events. In: Proceedings of the 6th International Conference on Weblogs and Social Media (ICWSM) - Workshop on Real-Time Analysis and Mining of Social Streams (RAMSS). Palo Alto, California: AAAI Press; 2012.Events play a prominent role in our lives, such that many social media documents describe or are related to some event. Organizing social media documents with respect to events thus seems a promising approach to better manage and organize the ever-increasing amount of user-generated content in social media applications. It would support the navigation of data by events or allow one to get notified about new postings related to the events one is interested in, just to name two applications. A challenge is to automatize this process so that incoming documents can be assigned to their corresponding event without any user intervention. We present a system that is able to classify a stream of social media data into a growing and evolving set of events. In order to scale up to the data sizes and data rates in social media applications, the use of a candidate retrieval or blocking step is crucial to reduce the number of events that are considered as potential candidates to which the incoming data point could belong to. In this paper we present and experimentally compare different blocking strategies along their cost vs. effectiveness tradeoff. We show that using a blocking strategy that selects the 60 closest events with respect to upload time, we reach FMeasures of about 85.1% while being able to process the incoming documents within 32ms on average. We thus provide a principled approach supporting to scale up classification of social media documents into events and to process the incoming stream of documents in real time

Publications at Bielefeld University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Orthonormal Explicit Topic Analysis for Cross-lingual Document Matching

Author: Cimiano Philipp
Klinger Roman
McCrae John
Publication venue
Publication date: 01/01/2013
Field of study

McCrae J, Cimiano P, Klinger R. Orthonormal Explicit Topic Analysis for Cross-lingual Document Matching. In: Proceedings of the 2013 Conference on Empirical Natural Language Processing. 2013: 1732-1740

Publications at Bielefeld University

Human Activity Classification with Online Growing Neural Gas

Author: Beyer Oliver
Cimiano Philipp
Panzner Maximilian
Publication venue
Publication date: 01/01/2013
Field of study

Panzner M, Beyer O, Cimiano P. Human Activity Classification with Online Growing Neural Gas. In: Workshop on New Challenges in Neural Computation (NC2). 2013: 106-113.In this paper we present an online approach to human ac- tivity classification based on Online Growing Neural Gas (OGNG). In contrast to state-of-the-art approaches that perform training in an offline fashion, our approach is online in the sense that it circumvents the need to store any training examples, processing the data on the fly and in one pass. The approach is thus particularly suitable in life-long learning settings where never-ending streams of data arise. We propose an archi- tecture that consists of two layers, allowing the storage of human actions in a more memory efficient structure. While the first layer (feature map) dynamically clusters Space-Time Interest Points (STIP) and serves as basis for the creation of histogram-based signatures of human actions, the second layer (class map) builds a classification model that relies on these human action signatures. We present experimental results on the KTH activity dataset showing that our approach has comparable per- formance to a Support Vector Machine (SVM) while performing online and avoiding to store examples explicitly

Publications at Bielefeld University